Rule−based Categorial Analysis of Unprompted Speech − a Cross−language Study
نویسنده
چکیده
In this study, we investigated the influence of language specifics in a cross-language task on the automatic segmentation with a self-learning algorithm for the integration of pronunciation rules. The goal of this paper is to present the linguistic and statistic results of a new method to automatically generate pronunciation rules for automatic segmentation of speech the German MAUSER system. MAUSER is an algorithm which generates pronunciation rules independently of any domain dependent training data either by clustering and statistically weighting self-learned rules according to a small set of phonological rules clustered by categories or by re-weighting “seen” phonological rules. For the generation of pronunciation rules the used algorithm does not require any domain dependent training data. By this method we are able to automatically segment cost-effectively large corpora of mainly unprompted speech.
منابع مشابه
Independent automatic segmentation by self-learning categorial pronunciation rules
The goal of this paper is to present a new method to automatically generate pronunciation rules for automatic segmentation of speech the German MAUSER system. MAUSER is an algorithm which generates pronunciation rules independently of any domain dependent training data either by clustering and statistically weighting self-learned rules according to a small set of phonological rules clustered by...
متن کاملAn Analysis of speeches of Hussein ibn Ali (AS) in the first step toward the incident of Karbala (Departing Medina to Mecca) based on John Searle’s Speech Acts
Linguistic theories can open new doors to historical analysis. This paper seeks to analyze the speeches of Hussein ibn Ali in the first step toward the incident of Karbala which was his departure from Medina to Mecca. The Speech Acts theory which roots in Discourse Analysis focuses on the role of language. It sees speech as an act that brings about actions in this world. Searle introduces only...
متن کاملCategorial grammars used to partial parsing of spoken language
Spoken language understanding is a challenge for the development of Spoken Dialogue Systems. Recognition errors and speech repairs make it impossible to get complete syntactic analysis. Shallow parsing and chunking seem to be efficient in order to start both a robust and precise analysis. This paper describes experiments made with Logus, a spoken understanding system based on incremental methol...
متن کاملA comparative sociopragmatic analysis of wedding invitations in American and Iranian societies and teaching implications
Wedding invitations (WIs), as a uniquely socially and culturally constructed genre, provide a distinct opportunity to compare the sociocultural values of different speech communities as reflected in the textual content and organization of the different moves. Students can be exposed to this genre and its different moves using a genre-based pedagogy. Genre-based ped...
متن کاملAn Inference-rules based Categorial Grammar Learner for Simulating Language Acquisition
We propose an unsupervised inference rules-based categorial grammar learning method, which aims to simulate language acquisition. The learner has been trained and tested on an artificial language fragment that contains both ambiguity and recursion. We demonstrate that the learner has 100% coverage with respect to the target grammar using a relatively small set of initial assumptions. We also sh...
متن کامل